A patent keywords extraction method using TextRank model with prior public knowledge
نویسندگان
چکیده
Abstract For large amount of patent texts, how to extract their keywords in an unsupervised way is a very important problem. In existing methods, only the own information texts analyzed. this study, improved TextRank model proposed, which prior public knowledge effectively utilized. Specifically, two following points are first considered: (1) network constructed for each text, (2) based on dictionary data, edges represent interpretation relationship among all words entries. Then, node rank value evaluation formula designed networks introduced. Finally, can be extracted by finding top-k with higher values. our experiments, text clustering task used examine performance proposed method, wherein several comparison experiments executed. Corresponding results demonstrate that, new method markedly obtain better than methods extraction way.
منابع مشابه
Improved Automatic Keyword Extraction Based on TextRank Using Domain Knowledge
Keyword extraction of scientific articles is beneficial for retrieving scientific articles of a certain topic and grasping the trend of academic development. For the task of keyword extraction for Chinese scientific articles, we adopt the framework of selecting keyword candidates by Document Frequency Accessor Variety(DF-AV) and running TextRank algorithm on a phrase network. To improve domain ...
متن کاملInteresting Patterns Extraction Using Prior Knowledge
One important challenge in data mining is to extract interesting knowledge and useful information for expert users. Since data mining algorithms extracts a huge quantity of patterns it is therefore necessary to filter out those patterns using various measures. This paper presents IMAK, a part-way interestingness measure between objective and subjective measure, which evaluates patterns consider...
متن کاملExtraction of Keywords of Novelties From Patent Claims
There are growing needs for patent analysis using Natural Language Processing (NLP)-based approaches. Although NLP-based approaches can extract various information from patents, there are very few approaches proposed to extract those parts what inventors regard as novel or having an inventive step compared to all existing works ever. To extract such parts is difficult even for human annotators ...
متن کاملUnsupervised Information Extraction with Distributional Prior Knowledge
We address the task of automatic discovery of information extraction template from a given text collection. Our approach clusters candidate slot fillers to identify meaningful template slots. We propose a generative model that incorporates distributional prior knowledge to help distribute candidates in a document into appropriate slots. Empirical results suggest that the proposed prior can brin...
متن کاملAspect Extraction with Automated Prior Knowledge Learning
Aspect extraction is an important task in sentiment analysis. Topic modeling is a popular method for the task. However, unsupervised topic models often generate incoherent aspects. To address the issue, several knowledge-based models have been proposed to incorporate prior knowledge provided by the user to guide modeling. In this paper, we take a major step forward and show that in the big data...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Complex & Intelligent Systems
سال: 2021
ISSN: ['2198-6053', '2199-4536']
DOI: https://doi.org/10.1007/s40747-021-00343-8